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ABSTRACT 

Most  naturally  occurring  data  are  serial  in  space  or  time.  With 
randomised  designs,  analysis  which  ignores  the  serial  structure  is  possible. 
When  the  ordering  of  the  data  is  not  at  our  disposal  adequate  models  must  take 
specific  account  of  serial  structure  and  allow  for  error  dependence,  possible 
non-stationarity,  time  trend  removal,  dynamic  relationships  between  variables, 
feedback  between  variables,  and  choice  of  dependent  and  independent 
variables.  Stochastic  difference  equations  supply  a  useful  class  of  serial 
models.  Some  aspects  of  their  structure  are  illustrated  with  practical 


ANATOMY  or  SONS  TUU  SERIES  MODELS 


GEORGE  E.  P.  SOX 


INTRODUCTION 


Many  classical  statistical  models  asauaa  that  data  y1.y2» • • • ,yn  aay  ba  diractly 
raprasentad  by  indapandantly  and  identically  distributed  variables.  Indeed,  soate  of  ua 
have  so  long  been  exposed  to  this  idea,  in  courses,  textbooks,  and  papers  that  ue  almost 
autoaiatically  write  a  relation  for  the  joint  distribution  of  random  variables  y^.yj,  •  • .  ,yn 

as 


p<y1»y2 . y„) 


n 

n  f(y.) 
i-i  1 


(i) 


and  we  almost  believe  it. 

In  particular,  models  are  frequently  employed  of  the  form 

yu  "  fu  *  «u  (2) 

where  yu  la  a  "dependent*  (output)  variables,  fu  is  a  function  of  "Independent"  (input) 

variables  and  paraamtera,  and  eQ  is  an  error.  It  la  then  comon  to  suppose  that 

•u  ’  au  (3) 

where  *u_ i'*u'*ut1' *  *  *  ia  •  sequence  of  random  variables  supposed  independently  and 

A 

identically  distributed  about  zero.  Throughout  this  paper  we  shall  call  such  a  sequence 
"white  noise". 

These  assumptions  are,  of  course,  sweeping  ones.  When  they  are  not  approximately 
true,  results  derived  from  their  use  may  he  very  seriously  in  error.  Alternatively,  to 
take  a  more  positive  view,  the  explicit  modeling  of  serial  dependence  can  provide  answers 
to  many  important  practical  problems  in  forecasting,  feedback  control,  the  estimation  of 
transfer  function  relationships,  and  intervention  analysis. 

In  these  days  statisticians  do  not  ignore  serial  dependence,  however,  there  Is  perhaps 
a  tendency  to  behave  as  if  our  subject  could  be  divided  into  two  pieces  -  one  part 
concerned  with  problems  readily  recognized  as  time  series,  whsre  specific  allowance  is  made 
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for  sar ill  dependence ,  and  tha  other  pert,  consisting  of  everything  else,  where  models  like 
(1),  (2),  and  (3)  Bay  be  aafely  employed.  It  seeaa  doubtful,  however,  whether  any  auch 
diet inct ion  can  be  made. 


UBIQUITY  OP  SERIAL  DEPENDENCE,  NON-STATIONARITY,  AND  NON-HOMOGENEITY 
The  attractive  feature  of  independence  ia  its  mathematical  tractability,  not  its 
direct  association  with  familiar  phenomena  within  our  experience.  The  data  which  we 
mentally  process  in  the  conduct  of  our  daily  lives  is  highly  serially  dependent  not 
independent.  To  see  how  important  this  serial  nature  of  our  experience  is,  imagine  viewing 
the  86,400  separate  frames  of  a  one  hour  movie  film  after  they  had  been  rearranged  in 
randaai  order.  It  is  indeed  the  aerial  dependence  of  our  experience  which  makes  it  possible 
to  conduct  our  lives  in  a  rational  manner.  In  particular,  it  allows  us  continually  to 
project  recent  experience  and  so  to  make  mental  forecasts.  Comparison  of  what  is  expected 
in  the  immediate  future  with  actual  experience  leads  to  appropriate  adjustment  of  ideas  and 
behavior. 

Serial  Dependence  in  Agricultural  Field  Trials 

Although  the  normal  linear  model  with  independently  and  identically  distributed  errorB 
is  often  used  as  a  framework  for  the  analysis  of  agricultural  field  trials  it  is  doubtful 
for  example  whether  R.  A.  Fisher  would  place  any  faith  in  such  an  analysis  unless  the 
design  had  been  randomized.  Only  then,  he  asserted,  could  such  a  set-up  supply  an 

approximation  to  the  randomization  analysis.  There  are  many  reasons  for  randomization  but 

e 

certainly  the  need  to  cope  with  the  serial  dependence  which  would  be  expected  between 
errors  of  adjacent  plots  is  an  important  one. 

The  principles  enunciated  in  R.  A.  Fisher's  book  "The  Design  of  Experiments"  are 
consonant,  not  with  a  world  of  independent  errors  and  homogeneous  experimental  material, 
but  with  inhomogeneity  and  even  non-stationarity.  Thus  the  book  is  careful  to  confine 


Methods  for  analyzing  field  trial  data  which  take  direct  account  of  spatial  correlation 
have  been  discussed,  for  example,  by  Bartlett  (1978). 
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itMlf  to  comparative  experiments  —  not  absolute  experiments  and  the  estimates  of  desired 
contrasts  are  made  from  randomised  comparisons  within  blocks  of  site  eight  or  less ■  Study 
of  uniformity  data  such  as  that  of  Wiebe  (1935),  Figure  1,  shows  the  wisdom  of  such 
principles. 

Serial  Dependence  in  Industrial  Data 

Evidence  that  non-stationary  error  structures  are  also  to  be  expected  for  industrial 
data  is  provided,  for  example,  by  the  devices  employed  in  the  process  industries  for 
feedback  control.  Suppose  Xt  refers  to  the  level,  at  time  t,  of  an  input  variable  that 
can  be  manipulated  to  compensate  for  a  deviation  efc  from  some  desired  output  target.  A 
form  of  feedback  regulation  commonly  found  effective  employs  a  control  equation  having 
"proportional  plus  integral"  terms.  For  discrete  data  it  is  thus  of  the  form 

t 

\  "  Vt  +  k1  £  eu  (4> 

n”0 

where  kQ  and  k^  would  normally  be  of  the  same  sign. 

After  differencing  (4)  becomes 


C(,t',t-11  *  «t  -  9Vl 


(5) 


with 


C  «  (kg+k^ ) 


-1 


®  “  Ck. 


If  the  control  is  effective  then  et  will  be  a  stationary  process  implying  that  Xfc 
follows  a  non-stationary  process.  But  Xfc  is  the  compensation  for  the  disturbance,  which 


consequently  must  also  be  non-stationary. 

It  is  further  possible  to  show  that,  if  the  dynamic  relation  between  input  and  output 


can  be  approximated  by  a  first  order  linear  difference  equation,  then  (4)  can  provide 


minimum  mean  square  error  control  for  a  non-stationary  disturbance  zfc  modeled  by 


Wi 


at  -  6at-1 


(6) 
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YIELD  YIELD  YIELD 


MODELING  SERIAL  PHENOMENA 

Randomization  is  impossible  whan  tha  ordaring  of  tha  data  in  tiaw  or  apaca  la  not  at 
our  disposal .  It  than  becomes  necessary  to  Model  tha  apaclal  character iaitca  of  the  system 
arising  from  its  serial  nature.  These  characteristics  include 
error  dependence  and  possible  non-stationarity 
existence  of  tiae  trends 
dynanic  relationships  between  variables 
feedback 

choice  of  dependent  and  independent  variables • 

In  the  remainder  of  this  paper  we  discuss  the  nature  of  the  models  employed  to  accoseu>date 
these  characteristics.  For  illustration  examples  of  fitted  models  are  given.  However,  our 
purpose  here  is  only  to  discuss  model  structure,  so  details  of  the  model  building  process 
are  not  presented  but  can  be  found  in  accompanying  references. 

Linear  Difference  Equations 

The  crucial  first  step  in  showing  how  linear  difference  equations  might  be  used  in 
modeling  serial  data  was  taken  by  Yule  (1927). 

A  linear  difference  equation  such  as 

yt  “  8yt-1  *  “0Xt-1  +  "lXt-2  (7) 

can,  with  suitable  choices  of  the  coefficients  (S,uQ,u1),  represent  a  dynamic 

relationship  between  an  input  x  and  an  output  y.  Using  B  for  a  backshift  operator  we 

can  write  the  relation  (7) 

(1-«B)yt  *  (w(J+«1B)Pxt  .  (8) 

More  generally  with  6(F)  and  u(B)  finite  polynomials  in  B  any  such  linear  difference 
equation  may  be  written 

5(B)yt  *  w(B)xt  .  (9) 

For  a  stable  system  with  the  zero's  of  6(B)  outside  the  unit  circle  we  can  write  (9)  as  a 
"distributed  lag"  model 
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(10) 


yt  •  ISf  xt '  v(B,xt  ■  h  viB\  -  vjxt-j 

where  v(B)  is  ths  transfer  function  and  { v^}  is  the  impulse  response  function  of  the 
system.  The  impulse  response  weights  thus  determine  the  nature  of  the  linear 

aggregate  which  is  transferred  to  the  output  by  the  dynamic  model. 

Stochastic  Difference  Equations 

Stochastic  difference  equation  models  suppose  that  a  time  series  { z }  can  be 
represented  as  a  realization  of  the  output  from  such  a  dynamic  model  in  which  the  input  is 
white  noise.  Thus 


♦  <B)z  -  8(B)a 


(11) 


where 


♦  .(B)  -  1-$.B-$„B2-...-*  0(B)  -  1-0.B-0.B2-.  ..-8  B^ 

1  i  2  p  l  2  q 


(11a) 


P  12  q 

are  called  autoregressive  and  moving  average  operators,  respectively. 

A  model  of  this  kind  is  often  called  an  autoregressive-moving  average  model  of  order 
(p,q)  or  an  AKMA(p,q)  model,  see  for  example  Box  and  Jenkins  (1976). 

If  the  zero's  of  0(B)  all  lie  outside  the  unit  circle  then  we  can  write  the  model  in 
invertible  form 

♦  (B) 


*t  =  oTbT  zt  =  *<B)*t 


(1-v1B-»2B  «. . .  >zt 


V*t-i 


(12) 


where 


t-1 


l  Vt-i 

i=>1  1  * 


(13) 


Thus,  given  the  model  and  data  up  to  time  t-1  the  conditional  distribution  of  the 

—  2 

next,  but  not  yet  available,  observation  zt  has  mean  zfc_1  and  variance  .  It  is  now 
elementary  to  calculate  also  the  conditional  means  and  variances  for  future  observations 
2,  steps  ahead  and  so  provide  a  basis  for  forecasting. 

As  a  specific  example  consider  the  process  (6)  which  can  be  written 

(1-B)zt  -  (1-0B)at  .  (14) 

Then 


(1-B) 


at  ’  TfiT  *t  “  {1  “  (1-®)(1+9B+82B2+--.))zt 


zt“*t-1 


(15) 


and 


*t_,  -  1-8  l  *\- 
c  1-1 


(16) 


Urns,  for  the  particular  model  (14),  zfc  1  la  an  exponentially  weighted  moving  average 

(ewaa)  of  past  data  ending  at  the  t-1th  observation.  The  conditional  distribution  of 

2 

z.  has  this  ewna  for  its  mean  and  has  variance  o  . 
t  a 

In  the  model  (14),  4(B)  “  1-B  has  a  zero  on  the  unit  circle.  By  allowing  4(B)  in 
the  general  model  to  have  zero's  on  as  well  as  outside  but  not  inside  the  unit  circle  a 
valuable  class  of  models  is  obtained  for  representing  non-stationary  systems.  Also 
seasonal  models  of  period  s  can  often  be  represented  by  using  the  factorization 


4(B)  -  41(B)4g(B  ) 


8(B)  -  ei(B)0a(B  ) 


Some  of  the  issues  are  clarified  if  we  consider  the  problems  arising  when,  as 
occasionally  still  happens,  ordinary  least  squares  is  applied  to  the  analysis  of  time 
series  data. 

For  first  illustration  we  reconsider  some  data  plotted  in  Figure  2  and  first  studied 
by  Coen,  Gome,  and  Kendall  (1969).  These  were  quarterly  data  on 
z1fc:  Financial  Time  Share  Index 
Zjt:  0.  K.  Car  Production 
z3t :  Financial  Times  Commodity  Price  Index. 

The  original  authors  were  Interested  in  predicting  z1  using  lagged  values  of  z2 
and  z3  in  a  linear  regression  equation  which  allowed  for  possible  deterministic  linear 
trends  in  each  of  the  variables. 

With  et  a  random  error,  their  model  can  be  written 

yt  ■  “♦*ot*1*1t4#2*2t**t  (17 


where 


v  -  z 
Yt  It 


(17a) 


It  2, t-6  *  2t  3,t-7 

Denoting  fitted  values  by  y^,  x^,  and  x^,  after  estimation  of  a  supposed  linear  time 

trend  in  each  variable,  this  model  is  of  the  required  postulated  form 


y- y_ 


8,(x,-xit>  +  02<Vx2t’  +  *t 


(18) 
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Column  (a)  of  Table  1  shows  the  results  when  equation  (17),  or  equivalently  (16),  Is  fitted 
by  ordinary  least  squares.  Both  0 ^  and  $2  appear  enormously  significantly  different 


from  zero  leading  to  the 

conclusion  that 

the  stock  price  z^ 

can  usefully  be  predicted 

quarters  in  advance  from 

knowledge  of  Zj 

and  Zj. 

Table  1 

(a) 

(b) 

(c) 

Error  Model 

*t  *  at 

et  “  +«t-1+*t 

•ft-i  "  V0\-i 

*  5 

8,  *  10 

4. 7(0. 4) 

1.8(0. 9) 

1.6(0. 9) 

l2  X  10 

-  6. 1(0.5) 

-  1.9(1. 2) 

-  1.2(1. 2) 

*2 

0 

497 

298 

321 

a 

4  »  0.82(0.10) 

9  -  -0.06(0.15) 

Table  1.  Estimates  of  parameters  with  (standard  error)  for 
the  model  (17) 

(a)  ordinary  least  squares 

(b)  with  4  in  (19)  estimated  simultaneously  from  the  data 

(c)  with  non-statlonary  disturbance  model  (20). 

Modeling  Serial  Dependence  of  Errors 

A  numt .  of  models  which  allowed  for  possible  serial  dependence  of  the  errors  et 
were  fitted  by  Box  and  Newbold  (1970)  using  maximum  likelihood. 

In  particular  the  data  were  consistent  with  a  first  order  autoregressive  model 

et  “  *®t-1  *  *t  ’  (19> 

The  profound  differences  in  inference  that  result  when  4  is  not  forced  to  equal  zero 

(ordinary  least  squares)  but  is  estimated  from  the  data  simultaneously  with  the  regression 

coefficients  may  be  appreciated  from  comparison  of  columns  (a)  and  (b)  in  Table  1  where  it 

will  be  seen  that  the  previous  overwhelming  significance  of  8 ^  and  6^  now  disappears. 
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Of  the  standard  assumptions  that  errors  are  Normally,  Identically  and  Independently 
distributed  it  has  been  the  first  two  that  have  received  most  attention  in  the 
literature.  It  is  a  measure  of  the  importance  of  serial  dependence  that  it  would  require 
very  dramatic  departures  from  normality  and/or  identicality  to  influence  the  estimates  and 
their  standard  errors  in  the  spectacular  manner  seen  in  the  comparison  of  columns  (a)  and 
(b)  in  Table  1. 

Non-Statlonarity  and  the  Elimination  of  Time  Trends 

The  autoregressive  noise  model  efc  =  0.8et_1  +  afc  is  stationary  with  respect  to  time, 
however,  $  =  0.82(0.10)  is  close  to  unity,  for  which  value  the  model  become  non¬ 
stationary.  Notice  also  that  equation  (17)  allows  for  linear  dependence  of  all  the  series 
on  time  with  the  evident  intention  of  eliminating  ( non-stationary)  time  trends. 

Now  there  is  no  a  priori  reason  for  believing  in  the  reality  of  systematic  linear 
components  in  the  three  series.  Indeed  over  longer  stretches  of  time  we  should  expect  each 
series  to  sometimes  trend  up  ards  and  sometimes  downwnrds. 

A  non-stationary  disturbance  model  consistent  with  the  data  is  in  fact  of  the  form  of 

(14) 

( 1  -F)e  -  (l-etOa,.  .  (20) 

t  t 

From  column  (c)  of  Table  1  we  see  that  the  analysis  conducted  with  this  noise  model  gives 
results  similar  to  those  obtained  with  the  autoregressive  model.  In  fact,  in  this  model 
0  is  close  to  zero.  Thus  both  this  non-stationary  model  and  the  autoregressive  model  are 
pointing  to  a  random  walk  structure  for  the  error.  However,  let  us  continue  to  consider 
the  more  general  situation  where  the  noise  model  is  of  the  form  of  (20)  but  0  is  not 
necessarily  close  to  zero. 

We  are  most  familiar  with  the  behavior  and  implications  of  models  for  which  the  errors 
follow  a  white  noise  process  and  ordinary  least  squares  is  appropriate.  A  useful  device  in 
studying  models  with  a  more  complex  noise  structure,  therefore,  is  to  transform  the  model 
to  the  familiar  white  noise  form. 
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From  (15) 


«t  “  *(B)et  -  (21) 

where  *t_!  an  exponential  moving  average  of  the  e's  terminating  at  t-1  with 
smoothing  constant  6 . 

Operating  on  both  sides  of  equation  (IT)  with  x(B)  we  now  obtain 

V^t-i  ■  6Wxit'*i,t-i)  +  ‘V^t'^.t-i’  +  at  122  ’ 

where  for  example  x1t  “  X1  t  ^  is  th®  deviation  of  x1t  from  the  exponentially  smoothed 

value  x.  ..  This  equation  (22)  may  now  be  compared  with  (18). 

1  t  » 

We  seen  then  that  the  fitting  of  model  (17)  with  non-statlonary  noise  (20)  is 
equivalent  to  fitting  the  model  (22)  by  least  ordinary  squares  while  allowing  for 
elimination  of  stochastic  trends  by  exponential  smoothing.  More  generally  the  model  noise 
structure  would  indicate  precisely  what  kind  of  smoothing  should  be  used  for  elimination  of 
stochastic  trend. 

Dynamic  Relationships  Among  Variables 

Suppose,  in  the  share  price  data,  contributions  to  z1t  from  z2t  and  zJt  could  be 
approximated  by  distributed  lag  models 


x,t  -  v12(b)z2t 


x2t  “  v13(B>z3t  ' 


Then  the  lag  structure  assumed  in  (17a)  is  such  that  the  impulse  response  functions 
{v  }  and  {v  }  are  zero  everywhere  except  at  lags  j  *  6  and  j  *  7, 

'*,]  '  'if  J 

respectively,  where  they  take  values  of  unity. 

The  less  restrictive  difference  equation  models  of  the  form  of  (10)  (for  example, 
v12(B)  »  «i» ^ ^ ( B ) /6 ^ 2 < B >  >  are  dearly  more  likely  to  provide  adequate  dynamic  models. 
Notice  that  so  far  as  the  structure  of  the  resulting  models  is  concerned,  after 
transforming  the  errors  to  white  noise  as  before  we  should  have,  correspondina  to  (22): 


Zr*1,t-1  *  ^0+V12<B>  ^Z2t”Z2,t-1  ’  +  V13<BHr3t'Z3,t-1)  +  3t 


(23) 


The  contributions  to  the  response  Zj  and  z  2  and  Zj  would  thus  be  modeled  as  linear 
agregates  of  the  deviations  from  exponentially  smoothed  values  with  weights  supplied  by  the 
impulse  response  functions  of  the  dynamic  systems. 
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Transfer  Function  jjgddi 

In  the  particular  casa  of  tha  share  index  data,  analysis  fails  to  show  any  significant 
transfer.  However,  transfer  function  models  of  this  kind  have  frequently  been  effective. 
For  illustration  Figure  3  shows  hourly  data  from  the  'tain  Rivers  study  of  house  insulation 
(Pollack  and  Shoemaker  1978;  Socolow  1978).  The  estimated  model* 

0.12-M)  42B+0.40B2  24, 

v  -  - 1 _  ..  «.  iU0.26BM1-0.86B.  J. 

yt  1-0. 33B  *t  24  *t  lt  •» 

'  (l-B)d-B  ) 


allows  for  a  highly  non-stationary  seasonal  disturbance  associated  with  the  daily  change  in 
te*q>erature.  After  this  has  been  taken  account  of  the  transfer  function  relation  between 
outside  temperature  and  attic  temperature  can  be  quite  accurately  estimated. 

Particularly  in  engineering  and  environmental  problems,  transfer  function  models,  such 
as  the  above,  have  often  proved  valuable.  Also  the  transfer  function  set-up  with  one  or 
more  indicator  variables  has  provided  a  useful  model  for  intervention  studies  (see  for 
example  Box  and  Tiao  (1975)).  For  some  economic  and  business  data,  however,  results  from 
transfer  function  analysis  between  an  output  yt  and  input  xit'*2t' ‘ ' *  has  aometlmea 
been  disappointing.  In  particular  some  selected  xt  that  seemed  a  priori  to  be  a 
promising  predictive  input  has  in  practice  provided  only  a  weak  or  sometimes  a  non- 
detectlble  relationship.  In  this  connection,  however,  it  must  be  remembered  that  denoting 
the  output  by  yt  an  invertible  time  series  model  of  the  form  of  (11)  may  be  written 

m 

y,  *  I  \y*_,  +  •  (25) 

i«i  1  z  c 

Thus  this  univariate  time  aeries  model  can  employ  the  whole  past  history  of  y  for 
prediction  of  future  values.  Any  useful  xt  must  supply  Information  that  is  additional  to 
this. 


Explanation  of  the  model  is  as  follows.  The  temperature  inside  the  house  is  going  through 
a  large  daily  cycle  which,  using  a  seasonal  noise  model,  could  be  forecast  from  the 
experience  of  previous  days.  The  outside  temperature  xt  suitably  filtered  provides 
additional  current  Information. 
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OUTSIDE  ATTIC 

TEMPERATURE.  X.  TEMPERATURE 


Figure  3 


TIME 


Hourly  data  outside  temperature  and  attic  temperature 

yt  for  an  experimental  house  with  estimated  model. 
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Feedback 


In  any  case  tha  uaafulnass  of  tranafar  function  models  of  tha  kind  conaidarad  abova, 
ia  confinad  to  thosa  examples  where  tha  ralationahip  ia  unldiractional.  In  engineering 
examples,  whan  feedback  occurs,  its  pattern  ia  usually  known  from  the  physical  nature  of 
tha  system  and  so  it  can  ba  modeled  directly  (sea  for  example  Box  and  MacGregor  (1976)). 
However  in  business  and  aconoaiic  systems  the  nature  of  tha  feedback  is  usually  unknown.  In 
the  share  price  example,  for  Instance,  the  model  (17)  tacitly  aasumaa  tha  structure  indi¬ 
cated  in  Figure  4(a),  when  in  fact  any  or  all  of  the  ralationa  in  Figure  4(b)  could  occur. 
Multiple  Time  Series  Models 

Multiple  time  series  models  can  provide,  among  other  thinga,  a  general  framework 

within  which  all  possible  feedback  relations  between  k  aeries  can  occur.  They  thus 

present  possible  means  of  discovering  the  feedback  from  the  data.  In  particular,  a  class 

of  vector  models  may  be  employed  which  parallela  tha  univariate  models  of  Equation  (11)  but 

with  z  and  a^  now  k-dimensional  column  vectors  and  Eta  a* )  “I  .  The  autoregressive 
~t  ~t  -t~t  -a 

and  moving  average  parametera  are  replaced  in  these  models  by  k  x  k  matrices. 

For  example,  a  first  order  autoregressive,  first  order  moving  average  model  for  k  »  2 
series  would  be  of  the  form 


or 


(I-$B)at  -  (I-«B)at 


*  ^  a  eZ  a 

♦  a  ♦  0  a 

♦  9  a 

It 

11  1,t-1 

12  2,t-1 

It  11  1,t-1 

12  2. t-1 

2t 

"  *21*1,1-1 

*  *22*2, t-1 

♦  a  ♦  0  a 

*2t  21*1, c-1 

+  9  a 

22  2, t-1 

°11 

°  12 

*N 

w- 

o 

°22 

(26) 


(27) 


(28) 
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Xn  inverted  fore  such  a  model  may  be  written 


*1t  “  i  *1,t-i  ^  *j  *2,t-j  *1t 

r  (21)  .  r  (22) 

*2t  "  ^  ^  *J  *2,t-j  *2t 


(29) 


In  which  each  itiln  can  "remember*  a  linear  aggregate  of  Its  own  past,  together  with  a 
linear  aggregate  f row  the  past  of  the  other  series. 

When  a  multivariate  model  of  this  kind  is  applied  to  the  share  price  data  (Tiao  and 
Box,  (1981))  a  trivarlate  first  order  autoregressive-first  order  moving  average  model  is 


obtained,  after  appropriate  simplicatlon  the  model  implies  that  the  system  is  approximated 


by 


(1  -  .98B)* 
(.03)  * 


‘it 


(1 


-  . 93B)r 
(.04)k 


2t 


“  .2  +  a 

(.1) 


2t 


(30) 


(1 


•  83B)z  -  2.8  +  .40(1  -  .98B)z  .  +  (1  ♦  .4lB)a 

(.06)  *  (1.1)  (.23)  (.03)  (.12) 


.045 

.024  .085 

.019  .023  . 134J 

It  thus  appeals  that  for  the  share  data  all  three  series  behave  approximately  like 
random  walks  with  slightly  correlated  innovations.  There  is  (weak)  evidence  that  the  share 


price  is  a  leading  indicator  at  lag  1  for  the  commodity  index.  This  relation,  indicated  in 
Figure  4(c),  is  in  the  reverse  direction  to  that  assumed  in  Equation  (17). 

In  general  an  advantage  of  this  approach  is  that  no  prior  assumptions  need  be  made 
about  the  feedback  and  dynamic  interrelationships  between  variables  and  about  what  should 
be  regarded  as  an  independent  variable  and  what  a  dependent  variable.  Rather  we  can  allow 
the  data  with  rather  weak  modeling  to  point  to  the  structure. 


Conclusion 


The  modeling  of  serial  data  is  important  because  we  live  in  a  serially  dependent 
world. 

Considerable  success  has  been  achieved  in  the  last  50  years  or  so  in  Modeling  serial 
dependence  using  stochastic  difference  equations. 

Much,  however,  remains  to  be  done.  Some  important  current  topics  aret 

Parsimonious  parametrization  of  multivariate  time  series  models  (see,  for  example.  Box 
and  Tiao,  1975;  Reinsal,  1983). 

Possible  value  of  non-linear  models  (see,  for  example,  Priestley,  1978). 

Treatment  of  discrepant  observations  (see,  for  example,  Abraham  and  Box,  1979). 
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